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Introduction 


This concept — that we 
can’t identify a great 
teacher on the basis of 
her resume — should be 
liberating. But for some 
reason, it’s largely ignored 
in the education field. 


T he single best predictor of who will be a great teacher next year is who was a great 
teacher this year. 

The second best predictor is... Well, there really isn’t one that’s close. Some great 
teachers are seasoned veterans, while some are new to the field. Experience matters, of 
course, particularly in the early years of a teaching career, but it’s no guarantee of teacher 
effectiveness; there is a wide variation in performance, even among teachers with the 
same amount of experience. Some great teachers went through a traditional preparation 
program with a standard student-teaching experience. Others perform just as well with 
barely any prior training. Some great teachers are Ivy League grads, but others come from 
their local state colleges. The list could go on. 

This concept— that we can’t identify a great teacher on the basis of her resume— should be 
liberating. But for some reason, it’s largely ignored in the education field. Instead, without 
worrying too much about whether requirements will make teachers more effective, each 
state imposes myriad conditions on potential teachers and the programs they must attend 
to be eligible to teach. 


The result is requirements that amount to little more than barriers— new ways to 
keep potential teachers out of schools. These requirements, from licensure tests to 
minimum GPAs and SAT scores, from rules on student teaching to teacher-performance 
assessments, do not guarantee effective teachers. For some of these requirements, 
the research reports mixed results. Other requirements are too nascent for definitive 
conclusions, while still others are simply useless. 
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Candidates spend billions 
of dollars and thousands 
of hours on teacher 
preparation requirements, 
but there's little evidence to 
justify them. 


What's more, even if we knew the right inputs— the right requirements that candidates 
need to meet to become a teacher— we couldn't be sure they were well implemented at 
each of the 26,589 teacher preparation programs in 2,171 colleges, universities, and other 
education providers across the country. 1 That dispersion makes it challenging to create 
effective policies for teacher preparation. 

In response, the latest efforts to improve preparation shift away from input requirements 
for candidates and their programs and toward each program’s outcomes. Policies then 
become less about defining the “how” or "who” of teacher preparation and more about 
defining the end result— student learning growth, for example, or teacher placement rates. 
This shift is an improvement, but it is fraught with challenges when it comes to deciding 
which measures determine success and if those measures can distinguish programs from 
one another. 


This paper attempts to trace what we know about identifying and training successful 
teachers. It starts with a sobering conclusion: Although candidates spend billions of 
dollars and thousands of hours on teacher preparation courses, we don’t yet have a body 
of evidence justifying those requirements. Nor do we know how to measure and define 
a successful teacher training program. There is not yet any magic cocktail of program 
and candidate requirements that would ensure all teachers are great before they begin 
teaching. As such, policymakers should invest much more time and resources into learning 
about the science of teaching and how individual teachers develop their skills. And in the 
meantime, policies should reflect the fact that we know far more about a teacher after 
they enter the classroom than beforehand. 
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We Don’t Know How to Train Good Teachers 


The best available evidence 
suggests that we don't know 
how to train good teachers, 
no matter what stage 
they’re at in their careers. 


A s a country, we make an enormous investment in training teachers. We estimate 
it costs $24,250 to train the average teacher candidate, and each candidate 
will spend an average of 1,512 hours in training 2 — more hours than the typical 
American teacher works over the course of a school year— to meet the entry requirements 
to become a teacher. 3 Collectively, we estimate that over the course of just one single year, 
new teachers will invest $4.85 billion and 302 million hours on their preparation. The real 
cost is even higher because these estimates don't factor in the opportunity cost. Nor do 
these estimates consider training delivered to teachers once they begin their careers, which 
the nonprofit TNTP suggests can amount to another $8 billion annually. 4 


That’s a lot of time and money, and we make these investments year after year for all of the 
nearly 200,000 teachers certified each year. 5 This investment should be producing effective 
teachers, but it's not. The best available evidence suggests that we simply don’t know how to 
train good teachers, no matter what stage they’re at in their careers. 

Historically, the education sector has relied on inputs— things like admissions criteria, 
required coursework, certification tests, years of teaching, and advanced degrees— to 
identify good, "highly qualified” teachers. But relying on inputs is risky, unless we’re sure that 
they are true indicators of quality. If not, candidates who may have been successful will be 
filtered out, and candidates who won’t ever be successful are allowed in. 
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There’s some evidence 
that GPA and SAT screens 
can predict teacher 
effectiveness, but the value 
of those credentials is 
relatively small, and they 
are not a guarantee for 
any individual. 


What's more, the inputs used for teacher training are unproven. Admissions criteria, required 
coursework, advanced degrees, and certification requirements tell us little about who is 
likely to be an effective teacher. These are all barriers to entry— put into place by states, 
preparation programs, accreditors, and the schools hiring teachers. But regardless of who 
set the requirements, no requirement ensures that only good teachers will reach students. 
And even after a candidate jumps through these hoops, we don't know how to provide the 
professional development and training that will improve her practice. Additional research, 
particularly into new measures such as teacher performance assessments like the new 
edTPA, may provide more insight into what works. But the research we have today doesn't 
support the current slate of requirements. 

It's a sobering, even humble, conclusion: At every stage of a teacher’s career we simply don’t 
know how to help her improve. 

We don’t know which candidates to admit 

Admissions criteria are the first barriers for most teacher candidates. There's a large national 
push for states to require high GPAs and SAT scores from students looking to enter teaching 
programs. The Council for Accreditation of Educator Preparation (CAEP),the accreditation 
body for educator-preparation programs, requires that providers set minimum GPA and 
admissions-test requirements for candidates. 6 CAEP’s minimum admissions scores are 
designed to rise overtime, reflecting a sense that teachers should be culled from the “best 
and brightest” candidates. 

But while there’s some evidence that these screens can predict teacher effectiveness, the 
differences in effectiveness between candidates who meet the requirements and candidates 
who don’t are small, and the research is conflicting. 7 Using data from New York City, 
researchers found a positive correlation between a teacher’s SAT math score and student 
achievement, but they found no relationship between student growth and a teacher’s SAT 
verbal score. 8 A 2015 study of a large urban district in the South did find positive effects for 
a teacher’s undergraduate GPA and attendance at a competitive college but concluded that 
"these characteristics together account for only a small portion— less than 2 percent— of the 
total variation in teachers’ value-added estimates, meaning that they are correlated with 
teacher value-added but only weakly.” 9 

While the evidence tilts toward greater teacher effectiveness for candidates with stronger 
incoming academic credentials (all else being equal), the value of those credentials is 
relatively small, and they are not a guarantee for any individual. This research also gives 
states no guidelines on where to set setting appropriate minimal standards that each 
candidate must meet. 
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We don’t know what coursework— if any— to require 


The “best practices” of 
teacher preparation often 
make intuitive sense, but 
aren’t backed by research. 


After being admitted to a preparation program, most candidates have to complete 
substantial coursework before they can teach. States require candidates to pass certain 
courses before they can be certified, but the courses vary from state to state, and the actual 
content varies widely— even among different course sections within the same program and 
among separate programs within the same university. There’s little research on how much 
or what type of coursework would help candidates become effective teachers, so programs 
generally rely on best practices, as defined by the field. These best practices often make 
intuitive sense, but they’re not backed by research. It’s logical to assume, for example, that 
additional coursework and more clinical experience should produce better teachers. But 
that’s not necessarily the case. 


There is no evidence to show that the length of a preparation program is linked to better 
student outcomes. In a rigorous Institute of Education Sciences (IES) study, researchers 
from Mathematica Policy Research divided alternatively certified teachers into two 
subgroups, high coursework teachers and low coursework teachers, according to the 
number of coursework hours that were required by each teacher’s certification program. 
The researchers found no evidence that students of high-coursework teachers performed 
differently than students taught by teachers with less coursework. And neither of these 
groups performed any differently than traditionally certified teachers . 10 Another study 
found that Florida teachers who completed an alternative-certification path that required 
a bachelor's degree but no additional coursework were more effective than traditionally 
prepared teachers. The study also found that candidates with higher levels of academic 
achievement were more likely to pick shorter training programs, which may create a false 
picture of how well these programs train future teachers . 11 Regardless of the explanation, 
there is no clear correlation between the number of courses a future teacher completes and 
her eventual success as a teacher. 


This finding largely holds true for different types of coursework. The Mathematica 
researchers also looked at the effects of additional content coursework, such as math 
or English courses, and additional education-specific coursework, such as classes in 
classroom management, child development, or student assessment. The researchers 
found no relationship between additional content or education coursework and student 
achievement. This isn’t to say that these skills don’t matter, but it does suggest that we don’t 
yet know how to teach them. Other studies have found similar results. Using data from San 
Diego Unified School District 12 and Chicago Public Schools , 13 researchers found no link 
between a teacher’s undergraduate major or minor and student performance, even if the 
teacher earned a degree in math or English. There are some exceptions— research shows, 
for example, that secondary math and science teachers with more content expertise are 
more effective 14 — but overall, coursework requirements are not correlated with student 
achievement. 
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State certification 
is the sum of the 
required components 
of teacher preparation. 
It. should predict 
teacher effectiveness, 
but it doesn’t. 


One solution to the problem of more coursework not producing better teachers is to change 
how candidates spend their time. They can spend less time on theory or content learning 
and more time on the practice of actual teaching skills. But this, too, isn't as simple as it 
may seem. Merely adding additional clinical fieldwork experiences, or increasing the time 
a teacher spends in clinical practice during her preparation program, is also not associated 
with increases in student achievement. From some research, such as a 2008 study of New 
York City, 15 we know that some student teaching experience is better than none. But more 
is not always beneficial. In the IES study, the Mathematica researchers found that the 
number of required fieldwork courses had no effect on student achievement. 16 In another 
study, researchers analyzed test data from Florida for all students in 3rd through 10th 
grades between 2000 and 2005. The research showed no relationship between additional 
classroom-practice requirements and student achievement. There’s other research 17 that 
student teaching programs that are controlled by the preparation program, rather than the 
K-12 school, produce higher quality experiences and more effective teachers. But again— that 
evidence is an argument for quality, not quantity. 

No one knows exactly why these inconsistencies exist. It could be that the training itself 
is valuable but candidates select programs that better fit their needs and goals. Highly 
motivated candidates may choose faster and less restrictive alternative-route programs, 
while other candidates may benefit from the longer training offered by traditional programs. 

It would be impossible to design a randomized controlled trial to examine this phenomenon. 
But regardless of the effects of program selection, there are trade-offs between requiring 
more training and allowing candidates to enter the teaching field with less training. So far, 
there’s no clear indication that more training is always better than less. 

We don’t know what the right certification requirements are 

Before entering the classroom as teachers, candidates must meet state certification 
requirements. States offer different types of certification on the basis of the candidate’s 
background and future teaching subject. For an initial teacher’s certificate, states 
generally require candidates to complete an approved teacher preparation program; pass 
content examinations, such as Praxis Subject Assessments; and undergo a background 
check. Some states also offer alternative and professional certifications for candidates 
who are less or more qualified. 

State certification is the sum of the required components of teacher preparation: 
coursework, clinical practice, and assessments. It’s reasonable to think that certification may 
capture something meaningful that predicts teacher effectiveness. But it doesn’t. Research 
shows that certification has no relationship to teaching ability. 

Historically, there’s been a bias against short alternative-preparation programs. After all, how 
can teachers be effective without comprehensive training? But teachers who spend extra time 
and money on traditional teacher preparation programs are not necessarily better teachers 
than those who start teaching and get training along the way. 18 A 2006 study of 9,400 Los 
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Once teachers enter the 
classroom, we don’t 
know what professional 
development improves 
their practice. 


Angeles teachers found no statistically significant differences among traditionally certified, 
alternatively certified, and uncertified teachers. 19 In another study, researchers examined 
the effects of initial teacher-certification status (certified, uncertified, and alternatively 
certified) on student achievement, using data from New York City. The researchers found 
that the differences among different certification statuses were minimal; average traditionally 
certified, alternatively certified, and uncertified teachers had similar effects on student 
achievement. Instead, the researchers found substantial differences within certification status. 
For elementary math teachers, the differences between the best and worst teachers who had 
taken the same route to certification was almost ten times as large as any differences among 
teachers who were sorted by certification type. 20 

In the IES study mentioned above, researchers conducted mini-experiments in which students 
in a single grade at one school were randomly assigned to either a teacher who was trained in 
an alternative program or a teacher who was trained in a traditional program. At the end of 
the trial, there were no statistically significant differences in the students’ test-score gains. 

The researchers concluded that it would be "very difficult to predict, based solely on route of 
certification, the outcome of students placed with a particular teacher." 21 

We don’t know how to help teachers improve once they 
begin teaching 

Once candidates enter the classroom as teachers, we don’t know what on-the-job 
professional development improves their practice. 

A substantial body of research shows that teachers’ effectiveness tends to improve during 
their first few years of teaching but then levels off considerably 22 On average, teachers with 
20 years of experience are more effective than teachers with no experience, but they’re only 
slightly more effective than teachers with five years of experience 23 And effectiveness varies 
within experience levels. There is substantial overlap in the effectiveness of teachers with no 
experience, some experience, and a lot of experience. 24 

This evidence aligns with what researches have concluded about professional development 
for teachers. Recent research from TNTP found that districts spend about $18,000 per 
teacher per year on professional development, an expense that adds up to $8 billion annually 
for just the 50 largest school districts in the country. Despite that massive investment, 
only 30 percent of teachers show noticeable improvement, and TNTP cannot pinpoint the 
professional development elements that led those teachers to improve. The authors note that 
"no type, amount or combination of development activities appears to be more likely than 
any other to help teachers improve substantially, including the ‘job-embedded,’ 'differentiated' 
variety that we and many others believed to be promising.” 25 High-quality, randomized 
controlled studies conducted by IES have come to similar conclusions about professional 
development in reading, 26 mathematics 27 and science. 28 

It’s depressing but true: We simply don’t know how to train teachers to teach, no matter what 
stage they are at in their careers. 
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The Look to Outcomes 


S tates, programs, and schools have long focused on inputs, which were thought 
to predict teacher effectiveness and were often the best option available. But in 
the mid-2000s, it became possible to evaluate preparation programs through the 
outcomes of their graduates. 29 No longer would policymakers have to impose rules that 
were essentially best guesses about what would make an effective teacher. Policymakers 
could measure which teachers were effective and then use the data to shape policy. 


Louisiana and Tennessee were the first states to try out this idea. In 2000, Louisiana started 
looking at preparation programs through their outcomes data. Between 2003 and 2006, 
the state began linking preparation programs to the student-learning data of their recent 
graduates, making the data available to the public in 2007. 30 Louisiana’s work suggested that 
it was possible to discern program quality from completer outcomes. 31 Tennessee began a 
similar initiative in 2007, when the state passed legislation that required an annual report on 
preparation-program outcomes. Louisiana’s and Tennessee’s efforts laid the foundation for 
national interest in linking outcomes to preparation programs. 

The U.S. Department of Education officially endorsed linking programs to outcomes in 
the summer of 2009, when Secretary of Education Arne Duncan and President Barack 
Obama announced the Race to the Top (RTT) competition. The $4.35 billion RTT program 
awarded points to states for a range of accomplishments, including 14 points for improving 
the effectiveness of teacher and principal preparation programs. Toward the end of 2009, 
Secretary Duncan gave a forceful speech at Columbia University in which he called out 
preparation programs for "doing a mediocre job of preparing teachers for the realities 
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of the 21st century classroom." 32 In his speech, Duncan lauded Louisiana’s work linking 
completer outcomes to preparation programs. 

RTT prompted a number of states to begin linking programs to outcomes. Forty states and 
Washington, D.C., applied for RTT funding in the first phase alone, 33 and nineteen states 
received grants throughout the three phases of the competition. 34 Not all of those states 
promised teacher preparation reforms along the lines of Louisiana and Tennessee, but 
many of them did, including Rhode Island, Florida, Ohio, North Carolina, Massachusetts, 
and Georgia. 

In 2011, the U.S. Department of Education took another step to encourage states to link 
preparation programs to outcomes. The department announced that it would begin the 
process of regulating Title II and Title IV of the Higher Education Act 35 to address teacher 
preparation accountability and reporting. Title II affects how states and institutions report 
on the quality of preparation programs and requires states to identify their low-performing 
programs. During the rulemaking process, the department pushed to include completer 
outcomes in states’ definitions of program quality. 

The regulation, which is still making its way toward a final rule, would require states 
to assess preparation programs on three performance outcomes: student learning 
(measured by student-growth or teacher evaluation results); employment (placement and 
retention rates, especially in high-need schools); and survey outcomes (of completers and 
employers). Although the rule is pending, if the final rule looks like the proposed version, 
all states will be required to link completer outcomes to preparation programs, beginning 
in April 2019, and to report the data publicly. 

The Promise of Outcomes 


Comparing outcomes 
would provide real 
evidence of results, rather 
than relying on anecdotal 
and largely unproven 
“best practices. ” 


Completer outcomes seemed to be the solution to at least a few teacher quality 
issues. The idea was— and is still— appealing: States could loosen preparation program 
requirements, give providers more freedom to design their programs as they saw fit, and 
then make decisions about programs on the basis of the success of their completers. If a 
program didn’t produce effective teachers, or produced teachers who didn’t get jobs, the 
state could warn the provider that it needed to fix the problem. If the provider couldn’t fix 
the problem, the state could close the low-performing program. States could learn from 
the highest performing programs, and target supports and consequences to gradually shift 
the teacher preparation market so that it offered only the best providers. 


Even if, at first, states can’t define productive interventions, outcomes allow them 
to differentiate among poor, satisfactory, and excellent alternative and traditional 
certification programs in a way that inputs do not. Theoretically, the data would reveal 
which programs produce the best teachers, and “consumers"— prospective teachers and 
potential employers— could act on that information. 
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Perhaps the most important short-term use for outcomes is providers' own continuous 
improvement efforts. Comparing outcomes across similar providers would give providers 
the evidence they need to change their program practice, rather than rely on anecdotal 
and largely unproven “best practices.” Tracking and publicly reporting outcomes would 
increase transparency and reduce information gaps so that providers could learn from 
others in the state. 


Bringing Outcomes Back to Reality 

For all the reasons mentioned above, linking outcomes to preparation programs was and 
remains a promising approach to improving educator preparation and the quality of the 
teaching force. But the initial enthusiasm for this work must be tempered by new research 
and ongoing implementation challenges. 


If there are no meaningful 
differences in teacher 
effectiveness between 
programs, then states, 
policymakers, and 
consumers have no 
reliable information to 
guide them. 


Recent studies from Missouri and Texas suggest that completer outcomes may not 
differentiate providers as distinctly as hoped. Researchers attempted, and failed, to use 
outcomes data to differentiate programs by the effectiveness of their completers. But 
there was no pattern in the types of teachers these programs produced. The implication 
can’t be overstated: If there are no meaningful differences in teacher effectiveness 
between programs, we’re back at square one. Outcomes work assumes that a teacher’s 
effectiveness is determined in large part by the preparation program she attended, and 
that preparation programs produce teachers who are at a consistent level of effectiveness. 
In other words, a great preparation program is one that produces highly effective teachers, 
a good preparation program produces effective teachers, and so on. But if that’s not the 
case, as the Missouri and Texas research suggests, then policymakers have no reliable 
information to guide them. States can’t reward or learn from great programs— those 
programs that produce the most effective teachers— if they can’t identify great programs. 
Without meaningful differences in outcomes among programs, school districts and 
potential candidates (the consumers in the preparation market) have little information 
about program quality. 


The Missouri and Texas findings also conflict with the research from Louisiana that 
fueled much of the original interest in outcomes. The researchers in Missouri used 
a different sampling method from the one employed in Louisiana, and they warn in 
their paper that the Louisiana research may have “overstated differences in teacher 
performance across preparation programs.” 36 Although Texas has a wide variety of 
preparation programs, including some run by school districts that require very little 
formal training, researchers were unable to find any meaningful differences in outcomes 
among 100 programs of all types and durations. 37 Combined, the Missouri and Texas 
findings suggest that policymakers should be cautious and temper their expectations for 
actionable results in their states. 
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Two other studies, out of North Carolina 38 and Washington State, 39 found some evidence 
of variation in preparation program outcomes, but suggest that the variations in outcomes 
within preparation programs are larger than those among them. The North Carolina study 
did not look at individual preparation programs, but it did compare student achievement 
across four groups of teachers: (1) traditionally certified teachers, alternatively certified 
teachers, and Teach For America corps members; (2) traditionally certified teachers from 
in-state and out-of-state programs; (3) traditionally certified teachers from in-state public 
and private universities; and (4) teachers who began teaching with undergraduate and 
graduate degrees. The researchers found small, statistically significant differences among 
certain groups of teachers for certain grades and subjects, but there was also a large 
variation in teacher performance within every preparation route, as shown in figure 1. The 
differences among category averages are distinguishable statistically, but the ranges show 
the wide disparities among teachers from the same program. 40 At best, the differences 
are small. At worst, the overlap among preparation routes makes any average differences 
meaningless for any individual candidate. This research, combined with the failure to 
replicate the Louisiana findings, is a warning that outcomes are not yet much better than 
inputs in identifying effective preparation programs. 

Implementation issues also bog down the outcomes-based approach as states confront 
difficult questions and trade-offs. To begin with, states may have access to their own data, 
but they're unable to continue collecting data on teachers who cross state lines. There’s 
also the question of how to interpret results. If a preparation program produces great 
teachers, does that mean the training program is great, or did the program just do a great 
job of selecting candidates? The cause may not matter to the school that hires the teacher, 
but it does matter to the state or preparation program trying to make sense of the results. 

States have also struggled with more technical questions. For example, what’s the right 
group size, or "n-size,” for reporting or accountability purposes? (In the context of teacher 
preparation, n-size is the minimum number of completers who can be included in a 
statistical analysis of program effectiveness.) The larger the n-size, the more confident 
states can be that the results truly reflect the program and are not just random noise. 

From a statistical perspective, a larger n-size is better, but it’s not always possible. Many 
programs and institutions produce a small number of completers every year, and only a 
fraction of those completers find jobs. Certain types of outcomes may also limit the sample 
of completers. (Only completers in specific grades and subjects, for example, will have 
student-learning data.) So setting a higher minimum n-size allows for statistical analysis and 
protects completer privacy, but it may incentivize providers to produce fewer completers in 
order to avoid public reporting. States deal with n-size issues by "rolling up” completers across 
multiple years or similar types of programs, or by linking outcomes to whole institutions 
rather than to each program. But an n-size that forces providers to roll up completer cohorts 
may undermine the statistical conclusions and reduce transparency. 41 
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High School Science Elementary Math 


Figure 1 


Distribution of Teacher Effectiveness in Elementary Mathematics 
and High School Science 
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Note: Figure depicts the distribution of teacher effectiveness (25th to 75th percentile) for nine teacher categories in elementary 
mathematics and high school science. 

Source: Gary T. Flenry et al., “Teacher Preparation Policies and Their Effect on Student Achievement," Education Finance and Policy 9 no. 
3 (Summer 2014), 264-303. 
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The biggest obstacle 
to outcomes-based 
accountability is that 
states are struggling 
to find meaningful 
differences in outcomes 
across programs for a 
range of variables. 


Implementation challenges are not just limited to n-sizes. Some states have struggled to 
quickly link data systems, while others lack staff members who can analyze the data. And all 
the technical and logistical questions are intertwined with complex politics. Because of the 
high-stakes nature of this work, for example, states convene committees of pre-k through 
12th grade, and higher, education stakeholders to decide which outcomes to use and for 
what purpose. These decisions can quickly become part of political battles over testing and 
teacher evaluations. 

Perhaps the biggest obstacle to outcomes-based accountability is that states are 
struggling to find meaningful differences in outcomes among programs for a range 
of variables, notjust student growth. For example, some states would preferto link 
preparation programs to the overall evaluation results of their completers, rather than 
only to test scores. But when those underlying evaluation results fail to show any 
meaningful differences among teachers, it is impossible to find meaningful differences 
among teacher preparation programs. If evaluations better assessed teacher quality, then 
we could more precisely differentiate teachers and, therefore, providers. But outcomes- 
based accountability relies on the quality of measures that go into it. 

All states doing this work planned to use outcomes in their accountability processes, but 
because of the challenges we’ve discussed, few actually do. Florida, Massachusetts, and 
Ohio are the only states we know of that link completer outcomes to program approval, 
and the degree to which they actually integrate outcomes into their approval processes 
varies. Florida is the most objective state. It set thresholds that differentiate program 
performance on four levels, and it developed a formula to roll up five years of outcomes 
data into one summative rating. On the other end of the spectrum, Ohio allows the 
chancellor to determine whether and how outcomes are used in the program approval 
process. Massachusetts is somewhere in the middle; the state created a rubric to guide how 
reviewers assess programs using input data, completer outcomes, and on-site visits, but the 
rubric does not set minimum requirements for completer outcomes. 42 

All other states doing this work only use outcomes for continuous improvement and public 
reporting. Some states made plans to include outcomes in their accountability processes— 
Rhode Island, for example, plans to include outcomes but has not finalized its rules for how 
it will do so. And Delaware plans to attach consequences to its completers’ outcomes in 
2016. 43 But it’s telling that even the pioneers in this work, Louisiana and Tennessee, are still 
figuring out how to best use outcomes for accountability. 
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Without the ability to observe and act on differences across programs, states must bet 
that transparency alone will shape supply and demand in the market for new teachers. In 
an ideal world, potential educators would use data to inform their enrollment decisions, 
and school districts would integrate outcomes data into their hiring practices. But these 
goals depend on reliable, easy-to-understand data. So far, states are struggling to make 
the data meaningful to the public. When it comes to making decisions about what data to 
report or howto report them, states often prioritize making data accessible to preparation 
programs over other potential purposes. Preparation programs should, indeed, be one of 
the main users of the data, but the idea that these programs will use the data to voluntarily 
improve their training efforts does not reflect the full theory of action behind the push for 
outcomes-based accountability. 
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Following the Existing Research 


A ll of this means that policymakers are still looking for the right way to identify 

effective teacher preparation and predict who will be an effective teacher. Nothing 
tried so far guarantees effective teachers. Yet there are breadcrumbs that could 
lead to a better approach. 

One starting point would be to embrace the research on initial teacher effectiveness. In one 
Los Angeles study, researchers found that a teacher’s initial classroom effectiveness better 
predicted her future classroom effectiveness than anything schools and districts used at 
the time. The study divided teachers into quartiles on the basis of effectiveness and tracked 
their students’ progress over three years. Students assigned to teachers in the top quartile 
outpaced peers with similar baseline scores and demographics, while students taught 
by bottom-quartile teachers fell behind their peers. It’s commonly noted that teachers 
improve with experience, and that was also true here. But even though all teachers improve 
on average, teachers tend to stay as relatively effective or ineffective as they were when 
they started teaching. 44 

A 2013 study of New York City teachers found similar results. On average, teachers 
continued to perform at the same relative performance level, and, while the lowest- 
performing teachers did improve over time, they never caught up to their more effective 
peers, even after several years of teaching. Worse, while all teachers tended to improve, 
the teachers who were the lowest performers from the outset never caught up, even to 
the average new teacher. Figure 2 illustrates these findings. 
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Figure 2 


Mean Value-Added Scores by Subject, Quintile of Initial Performance, 
and Years of Experience 


Math 


ELA 
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Note: Mean value-added (VA) scores, by subject (math or English language arts), quintile of initial performance, and years of experience 
for elementary school teachers with VA scores in at least first 5 years of teaching. Each year follows the same sample of teachers. 
Sample includes elementary and high school teachers with VA scores in at least their first five years of teaching. 

Source: Allison Atteberry, Susanna Loeb, and James Wyckoff, "Do First Impressions Matter? Predicting Early Career Teacher 
Effectiveness,” AERA Open 1 no. 4 (October-December 2015), 1-23. 
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Policymakers should be 
less focused on defining 
the perfect mix of 
requirements for incoming 
teachers and more focused 
on measuring and acting 
on teacher effectiveness in 
a teacher’s first years on 
the job. 


The results from a teacher’s first few years of teaching are much more informative than 
anything we know about a teacher before she begins teaching. Taken together, the 
type of preparation program teachers attend, their credentialing and SAT scores, the 
competitiveness of their undergraduate institution, their race and ethnicity, and their 
gender explain less than 3 percent of the variation in classroom effectiveness. In contrast, 
a teacher’s initial value-added results, measured in her first years on the job, explain 21 
percent of her future effectiveness in reading and 28 percent in math. In other words, with 
information collected in just a teacher’s first year on the job, districts know 7-9 times more 
about their employee than they did before the year began. 

The current research suggests that a teacher’s preparation is a relatively small factor in her 
eventual effectiveness. It’s possible that future research will challenge these findings. But if 
we take the current findings to their logical extent, they suggest policymakers should be less 
focused on defining the perfect mix of requirements for incoming teachers and more focused 
on measuring and acting on teacher effectiveness in a teacher’s first years on the job. 


To accomplish this goal, policymakers need to shift their priorities. They should roll back 
burdensome and ineffective teaching requirements, rethink licensure, create systems to 
make preparation-pathway data accessible to the public, and create the conditions for 
alternative pathways to teaching. We propose four strategies for ensuring that schools and 
students have access to the best teachers possible: 


1. Make it less risky to try teaching 

In the current preparation system, becoming a teacher is laden with risk. States force 
candidates to spend thousands of dollars and thousands of hours on fulfilling requirements 
that won’t make the candidates more effective teachers. When a candidate finally begins 
teaching, she’ll waste more time on questionable professional development. If she wants 
a salary increase, she’ll pay for, and sit through, graduate coursework to earn an advanced 
degree that won’t likely help her improve her practice. And these are just the risks 
associated with teacher training— there is also the challenging and high-stakes work, the 
generally low pay, and the lack of advancement opportunities that come with teaching 
itself. The result is a profession in which potential candidates’ actual and opportunity costs 
are extremely high. 

There are a number of ways to reduce the risks associated with teaching and make the 
profession more appealing. Reducing barriers to entry is one place to start. 

States should make it relatively easy for teacher candidates to enter the field. This may 
sound counterintuitive, but current barriers to entry do not guarantee quality teachers, 
and they keep out candidates who could be effective. States should stop writing the 
rules, down to specific minimum test scores and course requirements, for who gets to 
be a teacher. To impose a minimum level of quality, states should require candidates 
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to possess a bachelor’s degree from an accredited college or university. But with the 
possible exception of secondary math and science teachers— for whom research has found 
a definitive link between training and outcomes— states should not impose additional 
academic requirements on candidates. 

Rather than layering on requirements, states should get out of the way and rely on local 
school districts to hire the best candidate for the job. School districts should have the 
flexibility to screen candidates for majors, such as science or English, for certain teaching 
positions. While school districts face their own challenges in hiring and evaluating teachers, 
it’s not the state’s role to determine the requirements teachers must meet, certainly not 
when the evidence is inconclusive on the merits of those requirements. 

This would not eliminate the demand for existing teacher preparation programs. Candidates 
would be free to enroll in these programs, particularly if the programs could sell themselves 
as adding value. Districts may wish to continue hiring from these programs. But there’s 
no reason that states should require this monopoly on teacher training to continue. In 
opening the door to the profession in other ways, states should make a distinction between 
allowing someone to try teaching and allowing her to become a licensed teacher with full 
responsibility for student learning. See recommendation 2 for more on this distinction. 

Objections to this approach 

Untrained teachers shouldn’t be allowed to teach 

This argument is an emotional appeal that hinges on the assumption that today’s 
barriers to entry prevent unqualified people from teaching our children. But in 
reality, today’s barriers are little more than veils, not guarantees of quality. We 
don’t know the right combination of inputs for training effective teachers, and it’s 
impossible to guarantee that all new teachers will be good teachers, regardless of 
the type of training they receive. 

This objection also assumes that with the right training, an incoming teacher 
should be given the same responsibility as veterans with many more years of 
experience. That approach, used in most schools across the country, does not 
align with how teachers learn on the job. Rather than trying to fit new teachers 
into a system they’re not prepared for, states should change an incoming 
teacher’s responsibilities so she does not have direct and sole responsibility for 
educating children until she has a record of effectiveness. The current system 
blindly assumes that all incoming teachers have that track record, but they don’t. 

Allowing anyone to become a teacher will lower the status of the teaching profession 
Let us be the first to admit it: There could be ramifications if we lower the barriers 
to teaching. If those barriers shape how society values the teaching profession— if 
the barriers give teachers status— society’s perception of teachers may change. 
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But this is a slippery objection, because there's no objective measure of status or 
what defines it. It’s also unclear which factors are the most important determinants 
of status— training requirements, salaries, the responsibilities of the job? 

For certain people, status is only conferred on professions like law or medicine 
that have strong barriers to entry. This camp tends to ignore the fact that law 
and medicine are much smaller professions with much higher salaries. Teaching 
is by far the largest profession made up of college-educated individuals, and any 
questions about barriers to entry must take into account the fact that schools 
simply need to hire lots of people. And states choosing to layer on additional 
requirements may drive away talented applicants who might otherwise be 
willing to try teaching. Ultimately, a profession’s status is marked by the quality 
of its practitioners, not just how they came to the field. 

To be clear, there are trade-offs inherent in this question. But status is not 
reason enough to maintain burdensome barriers to entry, particularly if those 
barriers are meaningless. 

2. Give schools and districts, not preparation programs, 
responsibility for recommending a candidate for licensure, 
and require that recommendation to be based on a track 
record of effectiveness 

In the current system, once a candidate meets the state requirements, her teacher 
preparation program recommends her for licensure. This is a flawed arrangement. Most 
preparation programs make recommendations on the basis of the completer’s academic 
performance and a limited amount of (perhaps undersupervised) student teaching 
experience. But, as noted above, there’s no guarantee that these experiences create 
a teacher who is prepared to be effective on Day One. Moreover, the current system 
encourages schools to treat all licensed teachers as interchangeable once they enter the 
classroom, with identical workloads, evaluation systems, and development opportunities. 

A better system would base licensure on actual candidate performance. To that end, states 
should strip the power to grant teacher licenses from preparation programs and give that 
responsibility to the districts where candidates teach. K-12 school leaders are the closest 
observers of actual candidate performance— much closer observers than preparation 
programs are— and they have access to data that preparation programs don’t, including 
student-level learning results. Instead of relying on a checklist of meaningless inputs, 
states should allow individual candidates to apply for full-time licensure only after they 
have a track record of performance. States should set their own rules for what a track 
record of performance looks like, but at the very least, licensure decisions should require 
observations of candidates’ performance in real-time classroom settings and demonstrated 
effectiveness in supporting students’ academic growth. 
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In this scenario, candidates would enter the classroom in a relatively low-stakes 
environment. Unlike today’s typical teacher, who, for her first full-time teaching job, is 
thrust in front of a full class of students at the beginning of the school year, a candidate 
should have opportunities for lower-stakes interactions with students, such as summer 
school and after-school and tutoring programs. Over time, the candidate would play a 
substantial role in the classroom, to the school’s benefit and her own. There are various 
models districts could follow, but as far as the state is concerned, a candidate should 
become an official "teacher of record” only after she has demonstrated her effectiveness 
and earned her license. This puts the focus on each teacher’s accomplishments rather than 
on her credentials or her preparation pathway. Programs such as the Teacher Advancement 
Program , 45 the Opportunity Culture initiative , 46 and Ohio’s Resident Educator Program 47 
focus on teacher accomplishments. TNTP started down a similar path almost four years 
ago . 48 

Withholding licensure has three effects: It ensures that only the most effective candidates 
earn full teacher licenses, it increases the prestige associated with becoming a fully 
licensed teacher, and it encourages ineffective candidates to self-select out of the 
profession. To be clear, this process does not rely on districts firing ineffective candidates. 
Recent research from New York City suggests that merely identifying low-performing 
teachers and delaying decisions on their tenure can encourage weaker teachers to leave 
the profession . 49 Districts should determine their own processes for making decisions— for 
example, candidates who do not earn their licenses could stay on as aides, small-group 
instructors, or co-teachers. But it is crucial that, as we lower the barriers to teaching, we 
raise the status and the standards for becoming a fully licensed teacher. 

Objections to this approach 

There’s no objective way of knowing who’s a great teacher 
This objection suggests it’s simply impossible to quantify a teacher’s 
contributions to the classroom, and so it would be impossible for teachers to 
ever demonstrate a record of effectiveness. We think this line of reasoning 
is wrong— high-quality observations, measures of student growth, and other 
indicators can be combined into a reliable estimate of teacher quality. It’s also 
easy to turn this objection back on the speaker. If it’s impossible to know who 
is a good teacher, it’s also impossible to put in place a training program that will 
ensure that candidates are effective. After all, how would we ever know that the 
program is working? 

This objection has implications for the entire teaching profession, not just 
for teacher preparation. If we need better measures of teacher quality, those 
measures are even more important for current teachers’ concerns— evaluation, 
compensation, professional development, retention decisions— than for 
teacher preparation. 
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Schools would need to change before preparation can change 
It's tempting to suggest that everything about how schools are run must change 
in order to change the way teachers are prepared. But while it would behoove 
states and schools to do business in a new way, it isn’t required. 

In fact, many places are already working toward our vision of a diverse network 
of preparation providers. Twenty-five years ago, there were no alternative-route 
preparation programs, and all teachers were trained in traditional, university- 
based programs. Today, one-quarter of new teachers arrive at the profession 
through alternative routes, and the largest teacher preparation program in the 
country is Teach For America. 50 TFA provides incoming candidates with just six 
weeks of training, yet its teachers are no less effective than veteran teachers, 
and they significantly outperform other new teachers. 51 In Florida, teachers 
entering the profession through a self-paced, online learning program that can 
be completed in a few months outperform other new teachers in the state. 52 And 
some districts and charter-management organizations already operate their 
own training programs. Without changing any other policies, states should learn 
from these examples and slowly offer new routes to teaching. If the outcomes 
from these new approaches look similar to today’s outcomes, states could feel 
confident about expanding the new programs. 

3. Measure and publicize results 

There’s a large market for new teachers— school districts hire between 75,000 and 90,000 
newly trained teachers a year— but that market has been plagued by a lack of reliable 
information. Candidates have little information about which teacher preparation programs 
will most likely lead to a job or help them become effective teachers once they enter the 
profession. When hiring a new teacher, districts typically look at only the candidate’s GPA, 
alma mater, and letters of recommendation. Some districts also apply their own screening 
tools, and a few districts ask candidates to teach a sample lesson. 

Districts are partly to blame for this market failure. All districts could ask for more 
information from candidates or require them to demonstrate their teaching skills. And 
even in the current environment, there’s nothing stopping districts from hiring teachers 
before the school year begins, such as for summer school, so they can start off in a low- 
stakes environment. Districts could also do a much better job of training new teachers and 
ensuring that they receive professional development tailored to their needs. 

But states are mostly to blame for the lack of reliable information about the performance 
of preparation programs. States are the only entities that could have enough data to 
objectively assess candidate performance, placement, and retention. Candidates will 
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never have this information unless the states collect and provide it. Districts will never see 
beyond their own hiring practices unless their state collects information from all schools 
and aggregates the results. 

In the world we envision, states would do a much better job of collecting and reporting 
on this information. They would collect and publish program-level data on teacher 
effectiveness, retention, placement, and years to licensure. And they would invest 
substantial time and effort in making the data accessible to the public. 

Here, the word “program” is not limited to a traditional preparation program at a college or 
university. It includes other providers in the education field, districts that offer their own 
training, and even "no program” candidates who enter the classroom as teachers without 
attending any program at all. 

By opening the door to new providers, states would be introducing new competition to an 
old market. Existing providers would be required to compete with alternative programs to 
show that the time and expense they require of candidates are worthwhile. With new data 
tracking outcomes, candidates would be equipped with reliable information about their 
options and would be free to choose their own pathways. Districts would not be required 
to select candidates from less intensive preparation programs; they'd be free to choose 
their own candidates, just as they do now. The majority of candidates could continue going 
to existing preparation programs, but those programs would have to compete for market 
share, rather than corner the market by default. 

In the short term, there’s reason to be cautious about what these outcomes can tell 
us. But in the long term, collecting and reporting outcomes information is the only way 
teacher preparation can ever improve. Without a systematic way to track outcomes, we 
cannot know what makes an effective teacher and what does and does not matter for 
kids. If states track outcomes, they have a path forward. If they don’t, they’re left blindly 
wandering from input to input. 

Objections to this approach 

If completer outcomes data aren’t meaningful, it’s not worth the time and effort 
to track and publish them 

There’s been very little evidence that preparation programs can be distinguished 
from one another on the basis of completer outcomes for things like student- 
growth scores or teacher-evaluation ratings. This objection assumes that, given 
the lack of actionable information, there’s no value in measuring and publishing 
completer outcomes. 
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While it’s true that the data may not yet be meaningful from a statistical 
perspective, collecting, analyzing, and publishing outcomes data are the only 
ways preparation programs can ever know if they’re improving. Measuring and 
publishing completer outcomes data bolsters programs’ continuous improvement 
efforts, giving them deeper insights into the information they already have. 
Working with the data also builds technical capacity and allows researchers to 
study the policies of teacher preparation programs and gauge their effectiveness 
overtime. And in the absence of rigorous state accountability systems, public 
completer outcomes data give potential candidates and employers useful 
information that they can use to choose programs and hire teachers. 

Despite the evidence from Missouri and Texas, it’s possible that research in 
other states will find meaningful differences in program quality. That hope 
doesn’t justify blindly crafting an accountability system around outcomes, but 
it is enough reason to require all states to measure and publish completer data. 
We also remain open to the possibility that future assessments, whether the 
new Common Core-aligned assessments or another iteration, will reveal more 
variability among teachers and, consequently, preparation programs. 

Finally, although some outcomes data may not give states the information 
they need to change their path to the teaching profession, the data may 
provide useful information to schools and future teachers. For example, 
prospective teachers today have little information about which programs are 
most successful at placing teachers in full-time teaching positions and which 
preparation programs graduate teachers who stay in the profession. Research 
from Washington State found that teachers from some preparation programs 
had retention rates equal to four to five percentage points higher than their 
peers. This sort of information may be useful to both prospective teachers and 
their future employers . 53 

4. Unpack the black box of good teaching 

We don’t know how to identify or train good teachers. To make matters worse, as a field 
we tend to recklessly embrace faddish “best practices,” whether or not there’s research 
to back them up. When a new idea comes along, it’s reasonable to first try it on a small 
scale, measure the results, and then scale it up only if the research says it’s effective. 

But that’s not what states have done with teacher preparation. A number of ineffective 
requirements— from higher minimum GPAs to more clinical coursework hours to better 
teacher-performance assessments— started off as good ideas that states and programs 
codified as policy before the ideas were sufficiently tested. Instead, states, the federal 
government, and private philanthropy organizations should invest strategically in research 
on what makes a good teacher and use that research to make policy. 
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We see several possibilities for research. One is analyzing existing measures and inputs 
to see whether they matter for a wider range of outcomes. Most of the studies cited in 
this paper look at whether various inputs lead to higher student test scores. But these 
analyses use results from state tests that measure relatively low-level skills. By using 
new assessments that measures higher-order thinking skills, it's possible we will gain new 
insights into teacher preparation programs. The same might be true if researchers look at a 
wider range of outcomes, rather than just test scores. 

To learn more about teacher training, we also need to encourage more variability in the 
programs themselves. Rather than try to standardize teacher training before we know 
what works, we need to do the opposite: more experimentation and more research 
about how the unique features of programs affect outcomes. Researchers should try 
to penetrate the black box of what makes a great teacher and how districts can select 
or train for those characteristics. A study of Teach For America, for example, found 
that its screening mechanism had some predictive power, particularly in terms of how 
much "grit” or “stick-to-itiveness” the candidates demonstrated. 54 If the vast majority of 
differences manifest within individual preparation programs, rather than across them, it 
would be instructive to understand why a single program produces teachers with very 
different levels of effectiveness. 

Objections to this approach 

Research is slow, and it doesn’t trickle down to schools 
One criticism of research is that it’s too expensive and time consuming, and 
that policymakers don’t use it when making decisions. The current teacher 
preparation requirements justify this objection: States have implemented a wide 
range of requirements for candidates and preparation programs, despite the lack 
of research linking those requirements with teacher effectiveness. 

We are also concerned about the expense, timeliness, and usability of research. 
But without more and better research, policymakers will never have good 
information on what levers they can pull to improve teacher performance. After 
all, the money spent on teachers is the largest expenditure in education, and for 
schools, teachers are the most important, controllable factor in shaping student 
growth. In the last year alone, schools around the country spent $320 billion on 
teachers. 55 It makes sense to invest in maximizing that investment and finding 
ways to help educators thrive. 
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Embracing This Agenda 


C urrent methods of teacher training rely on imposing barriers to the profession, but 
there’s little evidence that the system is worth the investment of time or money. 
Policymakers have pushed for these barriers to entry in the mistaken belief that 
the status of a professional is determined by the length of her training. They’ve come up 
with new ways to make it more difficult and more expensive to become a teacher. Our 
proposal would stop this downward spiral. 


That’s not to say that ours is the only solution. In fact, we will happily be proven wrong. 

If research finds that new, different assessments could predict a teacher’s future 
effectiveness, for example, we would gladly endorse it as a method for screening 
candidates. If the Common Core-aligned assessments uncover consistent variations 
among preparation programs, it will be easier to know howto improve teacher preparation 
pathways. But that’s not what’s happening right now. 


As for the objections to the vision we’ve outlined here, we acknowledge that these 
obstacles are real. They are not prohibitive, however, and not an excuse to be content with 
the status quo. 
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